An Efficient and Performance-Aware Big Data Storage System
نویسندگان
چکیده
Recent escalations in Internet development and volume of data have created a growing demand for large-capacity storage solutions. Although Cloud storage has yielded new ways of storing, accessing and managing data, there is still a need for an inexpensive, effective and efficient storage solution especially suited to big data management and analysis. In this paper, we take our previous work one step further and present an in-depth analysis of the key features of future big data storage services for both unstructured and semi-structured data, and discuss how such services should be constructed and deployed. We also explain how different technologies can be combined to provide a single, highly scalable, efficient and performance-aware big data storage system. We especially focus on the issues of data de-duplication for enterprises and private organisations. This research is particularly valuable for inexperienced solution providers like universities and research organisations, and will allow them to swiftly set up their own big data storage services.
منابع مشابه
E2DR: Energy Efficient Data Replication in Data Grid
Abstract— Data grids are an important branch of gird computing which provide mechanisms for the management of large volumes of distributed data. Energy efficiency has recently emerged as a hot topic in large distributed systems. The development of computing systems is traditionally focused on performance improvements driven by the demand of client's applications in scientific and business domai...
متن کاملCorrelation of Big Data with Supply Chain Health Performance in Employees of the Tehran Intelligent Fuel System
Introduction: The dramatic growth of big data and its application in preventing waste of resources and increasing financial performance and supply chain health levels, need to be examined from different perspectives. This study aimed to determine the correlation between big data and supply chain health performance in employees of Tehran Intelligent Fuel System. Methods: In this descriptive cor...
متن کاملElectrical Energy Storage on the Hybrid Grid of Renewable Energy System Using Fuzzy Controller Optimization Algorithm
The main risks of arising from the using fossil fuels can be referred to environmental pollution, the effects of greenhouse gases, climate change and acid rain. For this reason, efficient use of energy in economic development has always been considered as an important goal of sustainable development. In this study, the effects of time-varying electricity prices in the energy storage components ...
متن کاملBig Data Storage Workload Characterization, Modeling and Synthetic Generation By
A huge increase in data storage and processing requirements has lead to Big Data, for which next generation storage systems are being designed and implemented. As Big Data stresses the storage layer in new ways, a better understanding of these workloads and the availability of flexible workload generators are increasingly important to facilitate the proper design and performance tuning of stora...
متن کاملCloud Computing Technology Algorithms Capabilities in Managing and Processing Big Data in Business Organizations: MapReduce, Hadoop, Parallel Programming
The objective of this study is to verify the importance of the capabilities of cloud computing services in managing and analyzing big data in business organizations because the rapid development in the use of information technology in general and network technology in particular, has led to the trend of many organizations to make their applications available for use via electronic platforms hos...
متن کامل